Our data consists of results from the POPULATION STUDY and the SURVEY.



The POPULATION STUDY has the following data files associated with it:

1. Population_study_raw_data_anonymized.csv — A CSV file of the population study results. 

Names and e-mails of individual faculty members were removed and replaced by random number identifiers. Columns in the CSV file are: random number identifier for the individual faculty member, faculty member’s department, faculty member’s broad discipline, faculty member’s rank, number of articles submitted by the faculty member in compliance with the Open Access Policy, total number of full-text articles uploaded by the faculty member to ResearchGate, number of full-text articles published in 2013 or later uploaded by the faculty member to ResearchGate, and whether the faculty member has a profile on ResearchGate. Values for having a profile on ResearchGate are “No”, “Yes”, and “Yes (author profile?)”. “Yes (author profile?)” indicates a suspicion that the profile was auto-generated by ResearchGate and not created by the researcher. 






The SURVEY, which was administered via SurveyMonkey, has the following data files associated with it:

1. Survey_questions.pdf - A PDF file, generated by SurveyMonkey, of the complete survey, including questions asked and all possible responses displayed to respondents for each question. 
 

2. Survey_all_summary.pdf - A PDF file, generated by SurveyMonkey, of the responses to the survey in an easy-to-read format, with graphics. Includes the number of respondents who answered each question, the number and percentage of respondents who selected each possible response, and all free-text comments. Note that certain possible responses for questions 2, 3, 7, 8, 12, 16, and 18 were intentionally not displayed to respondents (and thus could not be selected) but are reported in this file as having 0 responses. For the most accurate list of questions and possible responses, see Survey_questions.pdf. For complete data on all responses received, see Survey_full_responses_coded.csv. 


3. Survey_full_responses_coded.csv — A CSV file of the complete responses to the survey from each of the 135 respondents (identified by RespondentID). The data in this file contains the full set of responses collected in 2016 to the survey “Faculty Survey on URI Open Access Policy and ResearchGate”. The raw data generated by SurveyMonkey has been coded to facilitate data analysis. See Survey_full_responses_codebook.txt for a guide to the data variables.


4. Survey_full_responses_codebook.txt — A TXT file to accompany Survey_full_responses_coded.csv. Identifies each question by question number, whether the question was required, whether multiple responses were allowed, the number of respondents, and the full set of possible responses with the variable name for each response.


5. Q2_&_Q12_coded.txt — A TXT file indicating which of the answer choices for Question 2 and Question 12 are true and false, used to evaluate respondents’ actual knowledge of the Open Access Policy and ResearchGate in the statistical analysis.


6. Survey_statistical_analysis.R — An R script file quantifying certain survey questions. Multiple linear regression models are then applied to estimate relations between the responses and faculty participation in the Open Access Policy and submitting full-text articles to ResearchGate.


